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4.5 0.1 - + 

NMA 0.4 - + 

51.3 1.0 - + 

54.7 1.4 - + 

NMA 0.3 - + 

7.5 0.1 - - 

35.4 0.7 - 

311 4.5 + + 

1534 1.4 + + 

NMA 0.5 - + 

0.1 0.3 - - 



1 .6 0.4 
20.8 0.5 
0.1 0.3 
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FIGURE 31A 



10 20 30 40 50 60 

I I III i 

1 AAGGGTCCTC CTTAGGCTGA ATGCTTGCAG ACAGGATGCT TOGTTACAGA TGGGCTGTCA 

TTCCCACGAG GAATCCGACT TACGAACGTC TGTCCTACGA ACCAATGTCT ACCCGACACT 



61 CTCGAGTGCA GTTTTATAAG GGTGCTCCTT AGGCTGAATG CTTGCAGACA GGATGCTTGG 
GAGCTCACCT CAAAATATTC CCACGAGGAA TCCGACTTAC GAACGTCTGT CCTACGAACC 



121 TTACAGATGG GCTGTGAGCT GGGTGCTTGT AAGAGGATGC TTGGGTGCTA AGTGAGCCAT 
AATGTCTACC CGACACTCGA CCCACGAACA TTCTCCTACG AACCCACGAT TCACTCGGTA 



181 TTGCAGTTGA CCCTATTCTT CGAACATTCA TTCCCCTCTA CCCCTGTTTC TGTTCCTGCC 
AACGTCAACT GGGATAAGAA CCTTGTAAGT AAGGGGAGAT GGGGACAAAG ACAAGGACCG 



24 1 AGCTAAGCCC ATTTTTCATT TTTCTTTTAA CTCCTTAGCG CTCCGCAAAA CTTAATCAAT 
TCGATTCGGG TAAAAAGTAA AAAGAAAATT GAGGAATCGC GAGGCGTTTT GAATTAGTTA 



3 01 TTCTTTAAAC CTCAGTTTTC TTATCTGTAA AAGGTAAATA ATAATACAGG GTGCAACAGA 
AAGAXATTTG GAGTCAAAAG AATAGACATT TTCCATTTAT TATTATGTCC CACGTTGTCT 



361 AAAATCTAGT GTGGTTTACA TAATCACCTG TTAGAGATTT TAAATTATTT CAGGATAAGT 
TTTTAGATCA CACCAAATGT ATTAGTGGAC AATCTCTAAA ATTTAATAAA GTCCTATTCA 



421 CATGATAATT AAA TGAAATA ATGCACATAA AGCACATAGT GTGGTGTCCT CCATATAGAA 
GTACTATTAA TTTACTTTAT TACGTGTATT TCGTGTATCA CACCACAGGA GGTATATCTT 



481 AATGCTCAGT ATATTGGTTA TTAACTACTT GTTGAAGGTT TATCTTCTCC ACTAAACTGT 
TTACGAGTCA TATAACCAAT AATTGATGAA CAACTTCCAA ATAGAAGAGG TGATTTGACA 



54 1 AAGTTCCACA AGCCTTACAA TATGTGACAG ATATTCATTC ATTGTCTGAA TTCTTCAAAT 
TTCAAGGTGT TCGGAATGTT ATACACTGTC TATAAGTAAG TAACAGACTT AAGAAGTTTA 



601 ACATCCTCTT CACCATAGCG TCTTATTAAT TGAATTATTA ATTGAATAAA TTCTATTGTT 
TGTAGGAGAA GTCGTATCGC AGAATAATTA ACTTAATAAT TAACTTATTT AAGATAACAA 



661 CAAAAA TCAC TTTTATATTT AACTGAAATT TGCTTACTTA TAATCACATC TAACCTTCAA 
GTTTTTAGTG AAAATATAAA TTGACTTTAA ACGAATGAAT ATTAGTGTAG ATTGGAAGTT 



721 A GAAAA CACA TTAACCAACT GTACTGGGTA ATGTTACTGG GTGATCCCAC GTTTTACAAA 
TCnTTGTGT AATTGGTTGA CATGACCCAT TACAATGACC CACTAGGGTG CAAAATGTTT 
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FIGURE 31B 

781 TGAGAAGATA TATTCTGGTA AGTTGAATAC TTAGCACCCA GGGGTAATCA GCTTGGACAG 
ACTCTTCTAT ATAAGACCAT TCAACTTATG AATCGTGGGT CCCCATTACT CGAACCTGTC 

841 GACCAGGTCC AAAGACTGTT AACAGTCTTC TGACTCCAAA CTCACTGCTC CCTCCAGTGC 
CTGGTCCACC TTTCTGACAA TTCTCAGAAG ACTGAGGTTT GAGTCACGAC CGAGGTCACC 

901 CACAAGCAAA CTCCATAAAG GTATCCTGTG CTGAATAGAC ACTCTAGAGT GGTACAAAGT 
GTGTTCGTTT GAGGTATTTC CATAGGACAC GACTTATCTC TGACATCTCA CCATGTITCA 

961 AAGACAGAGA TTATATTAAG TCTTAGCTTT GTGACTTCGA ATGACTTACC TAATCTAGCT 
TTCTGTCTGT AATATAATTC AGAATCGAAA CyiCTCAAGCT TACTGAATGC ATTAGATCGA 

102: AAATTTCAGT TTTACCATGT GTAAATCAGG AAGAGTAATA GAACAAACCT TGAAGGGTCC 
TTTAAAGTCA AAATGGTACA CATTTAGTCC TTCTCATTAT CTTGTTTGGA ACTTCCCAGG 

1081 CAATGGTGAT TAAATGAGGT GATGTACATA ACATGCATCA CTCATAATAA GrGCTCTTTA 
GTTACCACTA ATTTACTCCA CTACATGTAT TGTACGTAGT GAGTATTATT CACGAGAAAT 

1141 AATATTAGTC ACTATTATTA GCCATCTCTG ATTAGATTTG ACAATAGGAA CATTACGAAA 
TTATAATCAG TGATAATAAT CGGTAGAGAC TAATCTAAAC TGTTATCCTT GTAATCCTTT 

12 01 GATATAGTAC ATTCAGGATT TTGTTAGAAA GAGATGAAGA AATTCCCTTC CTTCCTGCCC 
CTATATCATG TAAGTCCTAA AACAATCTTT CTCTACTTCT TTAAGGGAAG GAAGGACGGG 

12 61 TAGGTCATCT AGGAGTTGTC ATGGTTCATT GTTGACAAAT TAATTTTCCC AAATTTTTCA 
ATCCA^TAGA TCCTCAACAG TACCAAGTAA CAACTGTTTA ATTAAAACGG TTTAAAAAGT 

1321 CTTTGCTCAG AAAGTCTACA TCGAAGCACC CAAGACTGTA CAATCTAGTC CATCTTTTTC 
GAAACGAGTC TTTCAGATGT AGCTTCGTGG GTTCTGACAT GTTAGATCAC GTAGAAAAAG 

1381 CACTTAACTC ATACTGTGCT CTCCCTTTCT CAAAGCAAAC TGTTTCCTAT TCCTTGAATA 
GTGAATTGAG TATGACACGA GAGGGAAAGA GTTTCGTTTG ACAAACGATA AGGAACTTAT 

1441 CACTCTGAGT TTTCTGCCTT TGCCTACTCA GCTGGCCCAT GGCCCCTAAT GTTTCTTCTC 
GTGAGACTCA AAAGACGGAA ACGGATGAGT CGACCGGGTA CCGGGCATTA CAAAGAAGAG 

1501 ATCTCCACTC GGTCAAATCC TACCTGTACC TTATGGTTCT GTTAAAAGCA GTGCTTCCAT 
TA6AGGTGAC CCACTTTAGG ATGGACATGC AATACCAAGA CAATTTTCGT CACCAAGGTA 



1561 AAAGTACTCC TAGCAAATCC ACCCCCTCTC TCACCGATTA TAAGAACACA CTTTAtTTTA 
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FIGURE 31C 



TTTCATGAGG ATCGTTTACG TGCCGGAGAG AGTGCCTAAT ATTCTTGTGT CAAATAAAAT 



1621 TAAAGCATGT AGCTATTCTC TCCCTCGAAA TACGATTATT ATTATTAAGA ATTTATAGCA 
ATTTCGTACA TCGATAAGAG AGGGAGCTTT ATGCTAATAA TAATAATTCT TAAATATCGT 



1681 GGG AT AT AAT TTTGTATGAT GATTCTTCTG GTTAATCCAA CCAAGATTGA TTTTATATCT 
CCCTATATTA AAACATACTA CTAAGAAGAC CAATTAGGTT GGTTCTAACT AAAATATAGA 



1741 ATTACGTAAG ACAGTAGCCA CACATAGCCG GGATATGAAA ATAAAGTCTC TGCCTTCAAC 
TAATGCATTC TGTCATCGGT CTGTATCGGC CCTATACTTT TATTTCAGAG ACGGAAGTTG 



1801 AAGTT^CCACT ATTCTTTTCT TTCCTCCCCT CCCCTCCCCT CCCTTCCCCT CCCCTTCCTT 
TTCAAGGTCA TAAGAAAAGA AAGGAGGGGA GGGGAGGGGA GGGAAGGGGA GGGGAAGGAA 



1861 CCCTTTCCCT TCCCTTCCTT TCTTTCTTGA GGGAGTCTCA CTCTGTCACC AGGCTCCAGT 
GGGAAAGGGA AGGGAAGGAA AGAAAGAACT CCCTCAGAGT GAGACAGTGG TCCGAGGTCA 



1921 GCAGTGGCGG TATCTTGGCT GACTGCAACC TCCGCCTCCC CGGTTCAAGC GATTCTCCTG 
CGTCACCGCG ATAGAACCGA CTGACGTTGG AGGCGGAGGG GCCAAGTTCG CTAAGAGGAC 



1981 CCTCAGCCTC CTGAGTAGCT GGGACTAQAG GAGCCCGCCA CCACGCCCAG CTAATTTTTG 
GGAGTCGGAG GACTCATCGA CCCTGATGTC CTCGGGCGGT GGTGCGGGTC GATTAAAAAC 



204 1 TATTTTTAGT AGAGATGGGG TTTCACCATG TTGGCCAGGA TGGTCtCGAT TTCTCGACTT 
ATAAAAATCA TCTCTACCCC AAAGTGGTAC AACCGGTCCT ACCAGAGCTA AAGAGCTGAA 



2101 CGTGATCCGC CTGTCTGGGC CTCCCAAAGT GCTGGGATTA CAGGCGTGAG CCACCACGCC 
GCACTAGGCG GACAGACCCG GAGGGTTTCA CGACCCTAAT GTCCGCACTC GGTGGTGCGG 



2161 CGGCTTTAAA AAATGGTTTT GTAATGTAAG TGGAGGATAA TACCCTACAT GTTTATTAAT 
GCCGAAATTT TTTACCAAAA CATTACATTC ACCTCCTATT ATGGGATGTA CAAATAATTA 



2221 AACAATAATA TTCTTTAGGA AAAAGGGCGC GGTGGTGATT TACACTGATG ACAAGCATTC 
TTGTTATTAT AAGAAATCCT TTTTCCCGCG CCACCACTAA ATGTGACTAC TGTTCGTAAG 



2281 CCGACTATGG AAAAAAAGCX; CAGCTTTTTC TGCTCTGCTT TTATTCAGTA GAGTATTGTA 
GGCTGATACC TTTTTTTCGC GTCGAAAAAG ACGAGACGAA AATAAGTCAT CTCATAACAT 
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2341 GAGATTGTAT AGAATTTCAG AGTTGAATAA AAGTTCCTCA TAATTATAGC AGTGGAGAGA 
CTCTAACATA TCTTAAAGTC TCAACTTATT TTCAAGGAGT ATTAATATCC TCACCTCTCT 
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FIGURE 31 D 



24 01 GGAGAGTCTC TTTCTTCCTT TCATTTTTAT ATTTAAGCAA GAGCTGGACA TTTTCCAAGA 

CCTCTCAGAG AAAGAAGGAA AGTAAAAATA TAAATTCGTT CTCGACCTGT AAAAGGTTCT 

24 61 AAGTTTTTTT TTTTTAAGCC GCCTCTCAAA AGGGGCCGGA TTTCCTTCTC CTGGAGGCAG 

TTCAAAAAAA AAAAATTCCG CGGAGAGTTT TCCCCGGCCT AAAGGAAGAG GACCTCCGTC 

2 521 ATGTTGCCTC TCTCTCTCCC TCCGATTGGT TCAGTGCACT CTAGAAACAC TGCTGTGGTG 

TACAACGGAG AGAGAGACCG AGCCTAACCA AGTCACGTCA GATCTTTGTG ACGACACCAC 

2 5S1 GAGAAACTGG ACCCCAGGTC TGGAGCGAAT TCCAGCCTGC AGGGCTGATA AGCGAGGCAT 

CTCTTTGACC TGGGGTCCAG ACCTCGCTTA AGGTCGGACG TCCCGACTAT TCGCTCCGTA 

2641 TAGTGAGATT GAGAGAGACT TTACCCCGCC GTGGTGGTTG GAGGGCGCGC AGTAGAGCAG 

ATCACTCTAA CTCTCTCTGA AATGGGGCGG CACCACCAAC CTCCCGCGCG TCATCTCGTC 

2 7 01 CAGCACAGGC GCGGGTCCCG GGAGGCCGGC TCTGCTCGCG CCGAGATGTG GAATCTCCTT 

GTCGTGTCCG CGCCCAGGGC CCTCCGGCCG A3ACGAGCGC GGCTCTACAC CTTAGAGGAA 

2761 CACGAAACCG ACTCGGCTGT GCCCACCGCC CGCCGCCCGC GCTGGCTGTG CGCTGGGGCG 

GTGCTTTGGC TGAGCCGACA CCGGTGGCGC GCGGCGGGCG CGACCGACAC GCGACCCCGC 

2821 CTGGTGCTGG CGGGTGGCTT CTTTCTCGTC GGCTTCCTCT TCGGTAGGGG GGCGCCTCGC 

GACCACGACC GCCCACCGAA GAAAGAGGAG CCCAAGGAGA AGCCATCCCC CCGCGGAGCG 

28 81 GGAGCAAACC TCGGAGTCTT CCCCGTGGTG CCGCGGTGCT GGGACTCGCG GGTCAGCTGC 

CCTCGTTTGG AGCCTCAGAA CGGGCACCAC GGCGCCACGA CCCTGAGCGC CCAGTCGACG 

2941 CGAGTGGGAT CCTGTTGCTG GTCTTCCCCA GGGGCGGCGA TTAGGGTCGG GGTAATGTGG 

GCTCACCCTA GGACAACGAC CAGAAGGGGT CCCCGCCGCT AATCCCAGCC CCATTACACC 



3 001 GGTGAGCACC CCTCGAG 
CCACTCGTGG GGAGCTC 
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FIGURE 32 

Potential binding sites on tne PSM promoter* 
Site Seq •'Location #nt matched 



AP1 TKAGTCA 1145 7/7 

E2-RS ACCNNNNNNGGT 1940 12/12 

1951 12/12 

GHF NNNTAAATNNN 580 11/11 

753 11/11 

1340 11/11 

1882 11/11 

1930 11/11 

1979 11/11 

2001 11/11 

2334 11/11 

2374 11/11 

2591 11/11 

2620 11/1 1 

2686 11/11 

. JVC repeat GG3NG3RR 8/8 

r-.-5 8/8 

1 1 80 8/8 

1185 8/8 

119: 8/8 

NFkB GGGRHTYYHC 95^ • 10/10 

uteroglobi RYYWSGTG 250 8/8 

92- 8/8 

1104 8/8 

IFN AAWAANGAAAGGR590 13/13 Cell 41:509 (1985) 



• the PSM promoter sequence 683XFRVS (Fig. 1) starts from the 5* end of the 
promoter fragment. The 3" region overlapps the previously publishied PSM cDNA at 
nt#2485.i.e. the putatative transcription start site is at nt#2485 on sequence 
683XFRVS. "The number refered to in this table is in reference to sequence 
683XF107 which is the complement and inverse of 683XFRVS. 
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FIGURE 39 

10 20 30 40 50 60 

I i 11 I I 

1 TTTGCAGACT TGACCAACTT TCTAAGAAAA GCAGAACCAC ACAGGCAAGC TCAGACTCTT 
AAACGTCTGA ACTGGTTGAA AGATTCTTTT CGTCTTGGTG TGTCCGTTCG ACTCTGAGAA 

61 TTATTAAATT CCAGTTTTGA CTTTGCCACT TCTTAGTGGC CTTGAACAAG TTACCGAGTC 
AATAATTTAA GGTCAAAACT GAAACGGTGA AGAATCACCG GAACTTGTTC AATGGCTCAG 

121 CTCTCAGCGT TAGTTACCCT ATTTTAATGA TGAGGATAAT ATTATCTGCC CAAATTATTG 
GAGAGTCGCA ATCAATGGGA TAAAATTACT ACTCCTATTA TAATAGACGG GTTTAATAAC 

161 GTATAGTAAA TATATAGCAT GTAAATCTCC TAGCAGAGTA CTGGGATTTC GCCACTTTAT 
CATATCATTT ATATATCGTA CATTTAGAGG ATCGTCTCAT GACCCTAAAG CGGTGAAATA 

241 TTCTTCTTTA CCAAGATACT CCTATTGGAC TTAATACACA GGACTAGTCT AAGGTATCAC 
AAGAAGAAAT GGTTCTATGA GGATAACCTG AATTATGTGT CCTGATCAGA TTCCATAGTG 

3 01 CAGGTAGTCC ACTCCTGCTC GGAATCTGAC CCGGGATTAG AGTAGGGCAT GGACCAGATG 
GTCCATCAGG TGAGGACGAG CCTTAGACTG GGCCCTAATC TCATCCCGTA CrTGGTCTAC 

3 61 GGTTTAAACA AATTCAATAT CTTCCACTAG CTTCACCTTG GGGTTGTAAA AGTTTTrGAA 
rCAAATTTGT TTAAGTTATA GAAGGTGATC GAAGTGGAAC CCCAACATTT TCAAAAACTT 



42 1 rrACACACTG TGCTCATAAC AATCTTCATC TCTTAAAAGG ATTTTATTCT TCCTGGTATC 
ZZZZZGTGAC ACGAGTATTG TTAGAAGTAG AGAATTTTCC TAAAATAAGA AGGACCATAG 

'4 81 CTCACTCTCA TCCCTTGTAT TCCGTGCTCA GTGGCTGACA CAGAAGAGTT CTTTATKNNN 
GAGTGAGAGT AGGGAACATA AGGCACGAGT CACCGACTGT GTCTTCTCAA GAAATANNNN 

54 1 NMNNKKKNNN CATCCTGTTC ATTTTTCAGA TCTCAGTTCA AGCATCTCGT CCTCAGTGTG 
NNT^KNNNNNN GTAGGACAAG TAAAAAGTCT AGAGTCAAGT TCGTAGAGCA GGAGTCACAC 

601 GTGTTKNCTG ATCCCTCACT CTAATCCAAG TCTTTCTGTT TTATGCACAG GTTGGAATCT 
CACAANNGAC TAGGGAGTGA GATTAGGTTC AGAAAGACAA AATACGTGTC CAACCTTAGA 

661 TATTTCCGTT TGCGNNCCAA TCNAATNGTA TTTAATATGC ATGTATATAT GTATGTGCAT 
ATAAAGGCAA ACGCNNGGTT AGNTTANCAT AAATTATACG TACATATATA CATACACGTA 



721 TTGTATGCTA NGCGATTAAG AACTAGAATA ATTAATAATT CGAAGTCTAG AAGTGG 
AACATACGAT NCGCTAATTC TTGATCTTAT TAATTATTAA CCTTCAGATC TTCACC 
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FIGURE 40A 

10 20 30 40 .50 60 

1 T GAAAAA TAC ATCAAAAATA GGCATCAGAT ACGAGCCTAT AGATAGGAJT" TA-~-T'~~rA- 
ACTTTTTATG TAGXriTTAT CCGTACTCTA TGCTCGGATA TCTATCCTCA AtXaAAAATA 

TJtTTSr^^* TGTATTATTT CTAAAACACA AATTATCAAT ATTACCTCTG ACATTAGGTC 
ATAACAACAT ACATAATAAA CATTTTGTGT TTAATAGTTA TAATgSgAC T^JHt^cI? 

121 AGATATTCTG , AATTTTAATT TCTCTTGCCT ACTTTCACTG AAAAAGAGT'' ATCJCa 

TCTATAAGAC TTAAAATTAA AGAGAACGGA TGAAAGTGAC TTTTTCTCAG TACGTTTGTC 

iirnr^^^t^r rSSf^^^^^^-^ ttgcaaaata tttttttatc caacttcaat gatacgtatt 

TAAAAATTCA ACGTTTGGTT AACGTTTTAT AAAAAAATAG GTTGAAGrTA CTATCCATAA 

241 GCTGTTAATT CTAAGATATG CATTAATTGT TTCAACTAAT GGGTGTCAAA CGAGATGT-r 
CGACAATTAA CATTCTATAC GTAATTAACA AAGTTGATTA CCcicAGTTT GCTctI^AAg 

rS^t^T™ GgC AAAAA GG AGATCCACCT TCTArTTTCA TAAAGTTTCT ATCTTCCTC 
ACTxi^ACTT CCGTTTTTCC TCTACGTGGA AGATGAAAGT ATTTCAAA3A TAGAAGGAGA 

3 61 GCTGACTCAA ATAAGCATTT AATACATTTT ATAACGAATT AJi.TTi-GA^- A-^A-^rra*. 
CGACTGAGTT TATTC"A_:.^ T7ATGTAJUUV TArTGCTTAA TTAArAC~A TAtSa™ 

421 taaataaatt atttccaagt gttgaaggaa attcagactt ctaatttgct ctga-^c-c- 

ATTTATTTAA TAAAGGnCA CAACTTCCTT TAAGTCTGAA GArrAAA?Gl GActAISct 
^^ ^^^^ ^ AATGCTCTGT GAGAGTTTGC GTTTCCAGTG A-A3TAGCGTG AGAAATCCAA 

ttgattttgt ttacgagaca ctctcaaacg caaaggtcac ttcAtcgcac tct?^I§g?? 

•541 GTCAGACAGC TACATGAAAC TACATTTArr AGCTCTCTGC CAr-ACACCAG TGCACGATAG 
CAGTCTGTCG ATGTACTTTG ATGTAAATGG TCGAGAGACG. GTCTGTGGTC ACGTCctItc 

GTAGCTAGAT CTCAGTCATA GCTNNNNNNN NKNNNNNNNN AGACCTTGCA 
GCGTCTTGTA CATCGATCTA GAGTCAGTAT CGANNNNNNN NNNNNNNNNN TCTGGAACGT 

661 CTTGGCTTTT AACCTGAAGG AGATAAGGCA AGATTCCAGG GTTTATTTAG AGAAATTACA 
CAACCGAAAA TTGGACTTCC TCTATTCCGT TCTAAGGTCC CAAATAAATC TCTTtII^GT 

721 GGATCTGGGA ATAAAGTAGT TACAAAATTA GTCCCCAACC AGCTTTCATG GAGCTTTCAA 
CCTAGACCCT TATTTCATCA ATGTTTTAAT CAGGGGTTGG TCGAAAGTAC CTCGAAACTT 
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FIGURE 40B 



781 TTATTAATTA TTCTAGTTCT TAATCGCATG CATACAATGC ACATACATAT ATACATGCAT 
AATAATTAAT AAGATCAAGA ATTAGCGTAC GTATGTTACG TGTATGTATA TATGTACGTA 



641 ATTAAAATAC ATGATTGGAC GCAAACGGAA ATAAGATTCC ACCTGTGCAT AAAACAGAAA 
TAATTTTATG TACTAACCTG CGTTTGCCTT TATTCTAAGG TGGACACCTA TTTTGTCTTT 



901 GACTTGGTTA GAGTGAGGGA TCAGGAAACA CCACACTGAG GACGAGATGN NI.'NNKNNNNN 
CTGAACCAAT CTCArTCCCT AGTCCTTTGT GGTGTGACTC CTGCTCTACN NNNNNNNNNN 



961 NTAGTGGGTG GGGGGCGGAC ATCAATAAAG AACTCTTCTG TGTCAGCCAC TGAGCACGGA 
NATCACCCAC CCCCCGCCTG TAGTTATTTC TTGAGAAGAC ACA3TCGGTG ACTCGTCCCT 



:C21 ATAAAGGGAT GAGAGTGAGG GCAANTACCA GAAGAATAAA ATCCTTTTAA GAGATGAAGA 
TATTTCCCTA CTC7CACTCC CGTTNATGGT CTTCTTATTT TAGGAAAATT CrrTA>TrCT 



1081 TTGTTATGAG CACAGTGTGT GGKTTCAAAA ATCTTTTAAC AACCCCAAGG TGAAGCTAGT 
AACAATACTC GTGTCACACA CCNAAGTTTT TAGAAAATTG TTGGGGTTCC ACTTCGATCA 



1141 TGGAAGATAT TTGAATTTGT TTAAACCCAT CTGGTCCTAG CCCTATTCTT TGAATCCGAA. 

AcrrrcTATA AACTTAAACA AArrrGGCTA gaccaggatc gggataagaa acttaggctt 



1201 GAGGTCAAGA ATTCCGAGCA GA3TGGACTA CCTGTGATAC CTTAGACTAG TCCTGTGTAT 
CTCCAGTTCT TAAGGCTCGT CTCACCTGAT GGACACTATG GAATCTGATC AGGACACATA 



1261 TCAAGTCCAA TGAGAGTATC TGTAAGAGAA TAAGTGCGAA ATCCAGATCT 
AGTTCAGGTT ACTCTCATAG ACATTCTCTT ATTCACGCTT TAGGTCTAGA 
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FIGURE 41 



20 30 40 50 60 

I i I I t 

1 GGATTCTGTT GAGCCCTAGC TCATTATGAT GTCCTGTTGT CCTACCCAAA TAAGACTCAT 
CCTAAGACAA CTCGGGATCG AGTAATACTA CAGGACAACA GGATGGGTTT ATTCTGAGTA 



€1 CCCAACTACA TCTCAATAAT TAATGAAGAT GGAAATGAGG TAAAAAATAA ATAAATAAAT 
GGGTTGATGT AGAGTTATTA ATTACTTCTA CCTTTACTCC ATTTTTTATT TATTTATTTA 



121 AAAAGAAA CA TTCCCCCCCA TTTATTATTT TTTCAAATAC CTTCTATGAA ATAATGTTCT 
TTTTCTTTGT AAGGGGGGGT AAATAATAAA AAAGTTTATG GAAGATACTT TATTACAAGA 



181 ATCCCTCTCT AAATATTAAT A GAAA TCAAT ATTATTGGAA CTGTGAATAC CTTTAATATC 
TAGGGAGAGA TTTATAATTA TCTTTAGTTA TAATAACCTT GACACTTATG GAAATTATAG 



2A1 TCATTATCCG GTGTCA^CTA CTTTCCTATG ATGTTGAGTT ACTGGGTTTA GAAGTCGGGA 
A-TAATAGGC CACAGTTGAT GAAAGGATAC TACAACTCAA TGACCCAAAT CTTCAGCCCT 



3 CI AATAATGCTG TAAANKNNNK AGTTAGTCTA CACACCAATA TCAAATATGA TATACTTCTA 
TTATTAC^AC ATTTKNNNNN TCAATCAGAT GTGTGGTTAT ACTTTATACT ATATGAACA7 



3 61 AACCTCCAAG CATAAAAAGA GATACTTTAT AAAAGAGGTT CrTTTTTTTCT TTT''I'": " :''ri"I" l 
TTGGAGGTTC GTATTTTTCT CTATGAAATA TTTTCTCCAA GAAAAAAAGA AAAAAAAAAA 



4:: TCCAGATGGA GTTTCArTcc tgtcaggca:. GCNGAGTGCA GTGGTGCCAT CTCGGCTCAC 
AGGTCTACCT CAAA3TGAGG ACAGTCCGTC CGNCTCACGT CACCACGGTA GAGCCGAGTG 



.431 TGCAACCTCr ACCTCCCATG TTIAAGGGAT TCTCCTTCCT CAGTCTCCTG AGTAGCTGGG 
ACGTTGGAGG TGGAGGGTAC AAGTTCCCTA AGAGGAAGGA GTCAGAGGAC TCATCGACCC 



54 1 ATTACAGGTG TGCACCACCA CACCCAGCTA AnTTTGTAT TTTTAATAGA GACAGGGTTT 
TA^ivTGTCCAC ACGTGGTGGT G7GGGTCGAT TAAAAACATA AAAATTATCT CTGTCCCAAA 



601 CGATCGATGT TGGCCAGGCT AGTCTCGAAC TCCTGACCTC TAGGTGATCC ACCCGCTCAG 
GCTAGCTACA ACCGGTCCGA TCAGAGCTTG AGGACTGGAG ATCCACTAGG TGGGCGAGTC 



661 CTCCCAAAGT TGTAGAATTA CACGTGTGAG GCACTGCGCC TTGCCAGGAG ATACATTTTT 
GAGGGTTTCA ACATCTTAAT GTGCACACTC CGTGACGCCG AACGGTCCTC TATGTAAAAA 



721 GATAGGTTTA ATTTATAAAG ACACTGCACA GATTTGAGTT GCTGGGAAAT GCACGGATTC 
CTATCCAAAT TAAATATTTC TGTGACGTGT CTAAACTCAA CGACCCTTTA CGTGCCTAAG 



781 CAGTATGCA 
GTCATACGT 
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FIGURE 43A 



10 20 30 40 50 60 

I i I I I - ■ ! 

1 TATGGGAAAG TTTTCAGAGG AAATAAGGTA AGGGAAAAGT TATCTCTTTT TTTCTCTCCC 
ATACCCTTTC AAAAGTCTCC TTTATTCCAT TCCCTTTTCA ATAGAGAAAA AAAGAGAGGG 

61 CCAATGTAAA AAGTTATAGT GGGTTTTACA TGTGTAGAAT CATTTTCTTA AAACTTTATG 
GGTTACATTT TTCAATATCA CCCAAAATGT ACACATCTTA GTAAAAGAAT TTTGAAATAC 

121 AATACCATTA TTTTCTTGTA TTCTGTGACA TGCCACCTTA CAGAGAGGAC ACATTTACTA 
TTATGGTAAT AAAAGAACA7 AAGACACTG7 ACGGTGGAAT GTCTCTCCTG TGTAAATGAT 

181 GGTTATATCC CGGGGTTAAA TTCGAGCATT GGAATTTGGG CAGTGTAGAT GTTTAGAGTG 
CCAATATAGG GCCCCAATTT AAGCTCGTAA CCTTAAACCG GTCACATCTA CAAATCTCAC 

241 AACAGAAO.i TTTTTCTGTG CZZkZ.^ZZ" ATGGCTGTGG CGTA :.\.=.3AA GCATGCACTG 
7TGTCT7GTT AAAAAGACAC GAA7 3TrrAA TACCGACACC GCATGTTCTT CGTACGTGAC 

2 01 GGTTTATTAT TAACTTTCAG TATCTTTGTT ~AAA7Am TrTACAAAAA TGTT7ACTAA 
CCAAA7AA7A ATTGAAAGTC A7AGAAACAA AA777A7AAA AGATGTTTTT ACAAATGA77 

361 ATTAAATTG7 AG7ATG.iu5..r7 GTTA7AAA7A ' A7GAG3 " AAA CATTTACACA 7AGCAAATTT 
7AATTrAACA 7CA7ArrTAA CAA7ATT TAT 7AC7CCC7TT GTAAA7G7G7 A7CG7T7AAA 

42 1 AAAAATTACr 37CATTTGAT 77GT7AA7AT ATT7TTCTC7 77A3TGGGAA ATTAAA77AA 
7TTTTAA7GA - CA37AAACTA A.^. r-AA77A7A TAA-AAAGAGA AA7CACCCTT 7AA7TTAA77 

.4 61 AAAAT7CC77 7C:.ArTCTCA GACAJk7AGGA TTGCTG7GG7 C7AC7TGC77 ATTA7ATTTG 
7T77AAGGAA AGC7GACAG7 CTG77A7CCT AACGACACCA GATGAACGAA 7AA7ATAAAC 

54 1 7AGAGTCrAG AA7GCAA7CT CAC7ACAC7A 7AGACA7C7C ANNCTAACG7 AGGACAA7TC 
A7CTCAGA7C TTACG7TAGA G7GA7GTGA7 A7C7G7AGAG 7yNGATTGCA TCCTG7TAAG 

601 7GAGAAAC7A 77CCAGACCT CC77A7GGGC T7AGCCAAGG KTATCC7TCA GC7GGCAT7G 
ACTCTTTGA7 AAGGTCTGGA GGAA.7AGCCG AA7CGGTTCC NA7AGGAAGT CGACCGTAAC 



661 CAGGGTGACT TC7NCCTCNN AA7CCAGC7C 7C7NTCACAG A7GTGA7CCA AGAGACACTC 
GTCCCACTGA AGANGGAGNN TTAGGTCGAG AGANAGTGTC TACACTAGGT TCTCTGTGAG 



721 ACAATTAA7C AACTAGCATT C7AAATTTCA ATTCCAGATC TATTACCTTA ATATGGTAGC 
TGTTAATTAG TTGATCGTAA GATTTAAAGT TAAGGTCTAG ATAATGGAAT TATACCATCG 
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FIGURE 43B 



TSl TGAAGCTT7N K7CACTGTCA ATTCTGATCA GATATATGAC AATTTTAAAT TATTTGCAGT 

ACTTCG/lAAN NAGTGACAGT TAAGACTAGT CTATATACTG . TTAAAATTTA ATAAACGTCA 

64 1 GTGTAAGAAA CGCTTCAGGT AGTTTAAATT TAAGGCT 

CACATTCTTT GC3AAGTCCA TCAAATTTAA ATTCCGA 
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FIGURE 44A 

10 20 30 40 50 60 

i I I 1 i 

1 CTCCTTTGGC CCCTGCCAGC TGGGCATTTT TAACCTAGTT TACACAGTGT CTTTTTTTCC 

GAGGAAACCG GGGACGGTCG ACCCGTAAAA ATTCGATCAA ATGTGTCACA GAAAAAAAGG 

€1 TTATTTTAAA TTGGTTGTTC CAGATTCGGT AATATCAATT TTTAATATTA . CACTTAAATG 

AATAAAATTT AACCAACAAG GTCTAAGCCA T7ATAGTTAA AAATTATAAT G7GAATTTAC 

121 AGTACCAG.^A CTTTATCTTC AACCTTTTTC TCATTAGGCC TACAACATAG GACATCTCGG 

TCATGGTCrT GAAATAGAAG TTGGAAAAAG AGTAATCCGG ATGTTGTATC CTGTAGAGCC 

181 ATAGAATTTC CTTTTCTTTT TGC7ACTATA AGCTGCTAAA ATCCTCAGAA CATCAGATTT 

TATCTTAAAG GAAAAGAAAA ACGATGATAT TCGACGATTT TAGGAGTCTT GTAGTCTAAA 

241 AGAAATGTTC TTATTAGTGG TAGTGAGCAT TTGCTATTTC CTACCACTAG CTTACAAATA 

TCTTTACAAG AATAATCACC ATCACTCGTA AACGATAAAG GATGGTGATC GAATGTTTAT^ 

2C-1 lAATAAGCAA GTAGACCCCA CAGCCCAAAT TCCTATTTGT TCTACAGTCG AAAGGGAATT 

ATTATTCGTT CATCTGGGGT GTCCGGTTTA AGGATAAACA AGATGTCAGC TTTCCCTTAA 

TTTTAAAATT TAATTTCCAC TAAAGAGAAA AATATATTAA CAATCAAATT GACAGTCGAT 

AAAATTTTAA ATTAAAGGTG ATTTCTCTTT TTATATAATT GTTAGTTTAA CTGTCAGCTA 

TTTAATTr-rT AT3T3TA«i~ GTTTTCCCTC ATTATTTATA ACAATTCATA CTACAATTTA 

AAATTA^CGA TArATATTA-i. rA.AAAGGCAr- TAATAAATAT TGTTAAGTAT GA7GTTAAAT 

4S: A777AG7A.^-\ CA77777G7A GAC:ATA7T7 AAAACAAAGA 7ACTGAAAGT 7AATA7AAAC 

7AAA7CA777 GZkA^^^J-.ZAZ C7GG7A7AAA 7TTTG777C7 A7GAC7TTCA ATTA7A777G 

141 rrAG7GCA7G C7C7C7G7AG GCCACAGCCA 7AACCTG7AA GCACAGAAAA AT7TGTTCTG 

:-:-7CACG7AC GAGAGACA7C CGr.737CGG7 ATTCGACATT CGTGTCTTTT TAAACAAGAC 

€01 77AC7CTAAA CATC7ArArr GGCCAAAT7C CA;^.7GCTCGA ATTTAACCCC GGGATA7AAC 

AA7GAGATTT- G7AGA7:7GA CCGGTTTAAG GT7ACGAGCT TAAATTGGGG CCCTATATTG 

661 C7A37AAA7G 7G7CC7CTCT G7CAAGG7GG GCA7G7CACA GAA7ACAGAA CAA7CAATGG 

GA7CA7T7AC ACAGGAGAGA CAGTTCCACC CG7ACAGTGT CTTATGTCTT GTTAGTTACC 



721 TATTCA7AAA GT7TTAAGAA AA7GATTCTA CACATGTAAA ACCCACTATA ACTTTTTACA. 
ATAAG7ATTT CAAAATTCTT TTACTAAGAT GTCTACATTT TGGGTGATAT TGAAAAATGT 
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FIGURE 44B 



781 TTGGGGGAGA GAAAAAAAGA GATA^iTTTTT ACCTTACCTT ATTTCC7CTG AAAACTTTCC 

AACCCCCTCT CTTTTTTTCT CTATTA.\AAA TGGAATGGAA TAAAGGAGAC TTTTGAAAGG 

5 4 1 rATATCTGC-r AATTA.rAATT TTCCGAGAGC AATTGATTTT CATGTCCCZ-T ICC 

GTATAGACCG TTAATGITAA AAGGGTrTC:- TTAACTAAAA GTACAGGGCA AGG 
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FIGURE 45A 



10 . 20 30 40 50 6' 

I . I I I , 

1 GATGCTATTT GGGCAATTTC TTATTGACAG TTTTGAAATG TTAGGCTTTT ATCTCCA''^ 

CTACGATAAA CCCCTTAAAG AATAACTGTC AAAACTTTAC AATCCGAAAA 7AGAGGTAAA 



61 TTTAGTACTT AAATTTTCCA ACATGGGTGT TGCTTGTTAT TTTATCAGTA TAAAATAGAA 
AAATCATGAA TTTAAAAGGT TGTACCCACA ACGAACAATA AAATAGTCAT ATTTTATCTT 



121 GAGTGGTTCT GTTCTGGAAT TTAGTATATA CATGAGTATC TAGTGTATGT CAGCCATGAA 
CTCACCAAGA CAAGACCTTA AATCATATAT GTACTCATAG ATCACATACA GTCGGTACTT 



181 AATGAACCTT TCAGATGTTT AACTTCAGGG AACCTAATTG AGTCATTGCT CCAGACATTG 
TTACTTGGAA AGTCTACAAA TTGAAGTCCC TTGGATTAAC TCAGTAACGA GGTCTGTAAC 



241 TTGCTTTGA.i CCCACTATAT TKKKKNNNCT CGGGCAATr^A CTCAGTGTGG CA.2vGGATACT 

AACGAAACTT GGGTGATATA A-VNNNNNNGA GCCCGTTACT GAGTCACACC GTTCCTATGA 

301 ACTGCAGGCC TGTTTCTGGA AGGCACTGGA CTCCTCTGAT GCAAACTTTG GCCAGGGACT 

TGACGTCCGG ACAAAGACCT TCCGTGACCT r-A3GAGACTA CGTTTGAAAC CGGTCCCTGA 



3 61 CCTTGATAGC TCTT AAAT AG ATGCTGCA CC AACACTCTCT TTCTTTTCTC TCTTTTTCTT 

GGAACTATCG AGA^.TTTATC TACGACGTGG TTGTGAGAGA AAGAAAAGAG AGAAAAAGAA: 

i 

421 TATTCAATAT TAGACTACAA CCAZZZZ.-J-.: GA"TCTCAG GGTTTCTAGC TCTCTCTCAT 

ATAAGTTATA ATCTGATGTT CGTrAGATTT TTISAAGAGTC CCAAA3ATCG AGAGAGAGTA 



4S: TTCACACATG CTTTCCTAGT AATCTCTACT CATATATCTT ACTGCTACGC TGGGGCCAGA 
AAGTGTGTAC GAAA.GGA7CA TTAGAGATGA GTATATAGAA TZAZGATGCG ACCCCGGTCT 



541 TAACNKKNNN CTTCCATTTT GTZZZZAZCZ CTATTCTTCT 7CCCCTTCTG CTTTCATTAT 

ATTGNNNNNN GAAGGTAAAA CAAAAATAGA GATAAGAAGA AGGGGAAGAC GAAAGTAATA 

601 TGAAACTTTC TGCTTTCATT ATTGAAACTT TCCCAGATTT GTTCTGCTTA ACCTGGCATT 

ACTTTGAAAG A CG AAAGTAA TAACTTTGAA AGGGTCTAAA CAAGACGAAT TGGACCGTAA 



661 GGAACTGTTT CCTCTTCCCT GTGCTGCTTT CTCCCATTGC CATGTCCTTT. TTTTTTTTTT 
CCTTGACAAA GGAGAAGGGA CACGACGAAA GAGGGTAACG GTACAGGAAA AAAAAAAAAA 



721 TTTTTTTTTT TGAGACAGTG TCACTCTGTT GCCCAGGCTG GAGTGCAATG GTGCAATCTT 
AAAAAAAAAA ACTCTGTCAC AGTGAGACAA CGGGTCCGAC CTCACGTTAC CACGTTACAA 
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FIGURE 45B 



781 GGCCACTGCA ACCCCGACTC CGGGTTCAAG TGATTCTCTA CCTGCCTCAG CCTCCTGAGT 
CCGGTGACGT TGGGGCTGAG GCCCAAGTTC ACTAAGAGAT GGACGGAGTC GGAGGACTCA 

54 1 AGC7GGGATT ACACGTGCCA CCACTATGCC GGCTGATTTT CTATTTTAGT AGAGATGGGT 
TCGACCCTAA TGTCCACGGT GGTGATACGG CCGACTAAAA CATAAAATCA 7CTCTACCCA 

9C1 TCACATGCAG ATCAGCTGTT CCGACTCTGA CCAGKTWNKN NNNNNNNWNN ATCAAAGTCA 
A3T37ACG7C TAGTCGACAJ^. GGCTGAGACT GGTCNNKKKN NNKSNNNNNN TAGTTTCAGT 

9€1 GCCAAAGTGC TAGGCTTAGA GTAATTGTGT AATTTCCACA CAAGTGCAAC CTAGTGTAAT 
Cr-CTTTCACG ATCCGAATCT CA7TAACACA TTAAAGGTGT GTTCACGTTG GA7CACA77A 

i::: ZZZZZAAZ^J^ TGTNNK7ATG AA7G7CTCGA ACG77AG7AA CTAA7AACAA G7AG77AGTT 
C3GAGT7CTT ACA.VNNA7Ar 77ACAGAGC7 7C-CAATCATT GATTATTGTT CATCAA7CAA 

108 1 7A7AGA7G7A 7CCTA37A7G 7AGCA 
ATA7C7ACA7 AGGA7CA7AC A7CG7 
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FIGURE 46A 



10 20 30 40 50 60 

I I I I I 

1 CACAAAAAAA GATTATTAGC CACAAAAAAA CCTTGAACTA ACGCATTAAA ATGTTAATGG 
.GTGTTTTTTT CTAATAATCG GTGTTTTTTT GG AACTTCAT TGCGTAATTT TACAATTACC 



61 ATTCACTTTA TTGAGCATCT GCTCATAATA CTTTAATGAG TGCAAAGTGC TTTGAATATA 
TAAGTGAAAT AACTCGTAGA CGAGTATTAT GAAATTACTC ACGTTTCACG AAACTTATAT 



121 ATACGTCATT TAAACCTTAC CATAATTCTG AGGAATTGCT ACCTCCACTT CACAGATCGG 
TATGCAGTAA ATTTGGAATG GTATTAAGAC TCCTTAACGA TGGAGGTGAA GTGTCTACCC 



181 GCACAGGAGG CTTAGATAAC ATGCCCAAAG TCATGCTTCT AGTAAATGGA TATAATTAAG 
CGtGTCCTCC GAATCTATTG TACGGGTTTC AGTACGAAGA TCATTTACCT ATATTAATTC 



241 ATTCAAATTA TTGATAAGAA TTTGATCTGC :7TArrA:;rA TCTAGTAGTA AATCTAAAAG 
TAAGTTTAAT AACTATTCTT AAACTAGACG GAATCGTCAT AGATCATCAT TTAGATTTTC 



3 01 CGCTTTCCAG AGCATGTGCT GTTGATA3A3 CTTGATGTCT AACTCTCTGA AATTTTCCA7 
GCGAAAGGTC TCGTACACGA CAACTAICTC GAACTACAGA TTGAGAGACT TTAAAAGGTA 



3 61 TCTTATTTGT CTCACTGGTA TATAGTTATT TTTTACTACT TTCATACACC TA'CTAAGAAG 
AGAATAAACA GAGTGACCAT ATATCAATAA AAAATGATGA AAGTATGTGG ATGATTCTTC 



421 ACAGGAGGAT CAAAGATAGG ATTTCATTTA 3AATGCCTAA AGCTTCACGT ATTTTAATTC 
TGrCCTCCTA C-TTTCTATCC TAAAGTAAAT CTTACC-GATT TCGAAGTGCA TAAAATTAAG 



481 AGAATAAGAT TCAGGCAGAC CACCAGTATA ZZZZXZZZTC CCTGGTTATC TTTCAGCAGG 
7CTTATTCTA AGTCCGTCTG GTGGTCATAT ACGGTACCAG GGACCAATAG AAAGTCGTCC 



£4 1 TGACCGAGAA AGAAAACATG GTAATGTTTA TGAAATGGTG GGTTCTTGTA GTTTCACTTC 
ACTGGCTCTT TCTTTTGTAC CATTACAAAT ACTTTArCAC CCAAGAACAT CAAAGTGAAG 



601 AACATATCTG CCTTTACTGT ATTAAGATGA TGGATTAACT TATTCTTGAT ATGGGCATGT 
TTGTATAGAC GGAAATGACA TAATTCTACT ACCTAATTGA ATAAGAACTA TACCCGTACA 



661 AAAACAATAT ACTTTTACTA AACAGCTACA GAGAGACAAA TGTGTTTCCA GACAAACTTA 
TTTTGTTATA TGAAAATGAT TTGTCGATGT CTCTCTGTTT ACACAAAGGT CTGTTTGAAT 



721 AGAGACTGAG TGTTCAAACT GAATAATCTC GACCTTAATT GTAACTATAT TTTATGAAAT 
TCTCTGACTC ACAAGTTTGA CTTATTAGAG CTGG AATTAA CATTGATATA AAATACTTTA 
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FIGURE 46B 



781 CCAGCTGTAA GGCAAAACAG ACTCTTGGCT ACACGGCATT 7GTCTGTTAA TGATACTCAA 
GGTCGACATT CCGTTTTGTC TGAGAACCGA TGTGCCGTAA ACAGACAATT ACTATGAGTT 

g., .-;:<rTAACCG7 CACTTAATAA TGCTGAATAA TG7CATTAAT CT3AGATGTT AGTATGATCA 
GGAATTGGCA GTCAATTATT ACGACTTATT ACAGTAATTA GACTCTACAA TCATACTAGT 

rll ATG3 3AATCA CTCCTGAGCT CTCtJAAGCCC 
TACCCTTAGT GACGACTCGA CAGCTTCGGG 
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FIGURE 49 




647 bp 



234 bp 



SUBSTITUTE SHEET (RULE 26) 
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FIGURE 51 
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FIGURE 52 
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FIGURE 57 

Prostate Specific Promoter: 
Cytosine Deaminase Chimera 
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FIGURE 58A 

.10 2C 30 40 50 6C- 

1 GCGCCTTAAA «AAAA«™- . -wT.t.«AM ^-^^G AGAACGAATT TATATTTTTA 

CCGGAATTT TTTTTTTTTG A<--Aj«Awwa* i.Av./»v» 

K^r^r-^rrr'T CCTC- CA CTCCTATAAT TATGAGGAAC TTTTATTCAA 

?^^^CC^?T T^????S§A C-Gl§ISAaaT GAGGATATTA ATACTCCTTG AAAATAAGTT 

ISSS^I^S lal^SI?^! S^?^?o . 

I^I-- 

^ T-ii'-AA ACATGTAGGG TATTATCCTC CACTTACATT 

2-- --^r-II-Ur AArA;^-TATT TGTACATCCC ATAATAG-Aw «.--~v.oxAA 

— r-^'^rACG'*^ rGTAATCCCA GCACTTTGGG 

7:TT^-2C^^S §?-cgca??a ::-Ba§tgcg3 ^cattagggt cgtgaaaccc 

, , . ^1 --.i^i-.TCGAG AAATCGAGAC CATCCTGGCC P-.i.rATGGTGA 

----- ^llSrEi-:;;: -ZZ-.-l-^r. ac^rC^GCXC TTTAGCTCTG GTAGGACCGG TIGTACCACT 

_ i'^T - * """^GGCGT GGTGGCGGGC TCCT-* . AGT*- . 

i-:E:Ei2T:: 'i:tl::^~ I-Stt'ttt^. tI-gacccgca ccaccgcccg aggacatcac 

--- — TC-iu^C CGGGGAGGCG GAGGTTGCAG 

= - EE-i-EETcETE cEECXir^c- "-^--cTrTT agcg^cttg gcccctccgc ctccaacgtc 

— --G-GACAG AGTGAGACTC CCTCAAGAA>. 

5AI TCACrCAAGA - '■■•f^^-^zZZ"- tZ^llltZ' gACCACTGTC TCACTCTGAG GGAGTTCTTT 
AGTCGGTTCT ATCGC3GTGA ^^.--«-w-C_- «A^w«---.w 

^^.^nrr^x^r - - - "-^^G " GGAGGGGAAG GGAGGGGAGG GGAGGGGAGG 
601 GAA.AC.r.>^GG GAAGGGAAAw --^Cl^;: CCTCCCCTTC CCTCCCCTCC CCTCGCCTCC 

-r-— 'GAAG-- OGAGACTTT ATTTTCATAT CCCGGCTATG 
^^^=?^?T tI^G^CC^G .^C^2i?CC G^CTCTGAAA TAAAAGTATA CGGCCCATAC 

^^.^..^x xTi.-^-'^-JLA AATCAATCTT GGTTGGATTA ACCAGAAGAA 
l^>SiTJ^K SS5«C« .TiiS?A?i~ "lo^^AOAA CCAA=CTA*T TOCTCnCIT 
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FIGURE 58B 

7fil TCASAXCATA TATTCTCCTX ACTTCAATXC TTXCCXCCCA CCS5TAATCA OCTTCCikCXC 
IcTCTTCTAT ATAACACCAT TCAACTTATG AATCCTGGOT CCCCATtACT COAACCtOTC 

CXCCAOGTCC AAAGACTGTT AACAGTCTTC TCACTCCAAA CTCACtCCTC CCPCCACTCC 
CTOCTCCACG TTTCTGACAA TTCTCAGAAC ACTGAGCTT7 6ACTCACCAO CCACCTCACe 

901 CACAXGCAAX CTCCATAAAO GTATCCTGTG CTOAATACAC ACTGTAGAOT CCTACAAA6T 
CTGTTMTTT ScCTATTTC CATAOOACAC CACTTATCTC TGACATCTCA CCATCTTtCA 

961 AACACASACA TTATATTJU^O TCTTA8CTIT CTGACTTCCA ATCACTTACC TAATCTAOCT 
TTCTCTCTCT AATAIAATTC ACAATCCAAA CACTCAAOCT TACTQAA3C6 ATTACATCGA 

lOai AAATTTCAGT TT7ACCATCT CTAAATCAOC AA6ACTAATA GAACJJJJ^^ TOJIJJOOCTCC 
TTXAAAOTCX AAATCCtACA CATTTACTCC TTCTCATtAT CTTCTTTOOA ACTTCCCACG 

loai CAATOGTGAT TAAATCAC67 CATCTACATA ACATGCAICA CTCATAATAA OTCCTCXTtA 
CTTXcScTA ATTTACTCci CTACAT«TAT TCTACCTAGT aAOTATTATT CAOGAfiAAAX 

1X41 AATACTAGTC ACTATTATTA OCCATCTCTG ATTAGATTTC ACAATAGCAA CAWAOGAAA 
TTATAATCAG TGATAAtAAT CCGTACAGAC TAATCTAAAC TCTTATCCTT GTAATCCTW 

•tni ax-rxrt.G'TXC ATTCACGATT TTGTTACAAA GAGATCAA3A AATTCCCTTC CTTCCTCCCC 
5J;tSStc ^IIcTCCtIa licAATCTTT CTCTACTTCI TTAAOCCAAG GAAOaACCCG 

-AGGTCATCT ASGAGTrCTC ATGCTTCATT CTTCACAAAT TAATTTTCCC AAATTTTTCA 
;TCCA«IS TCCTCaIcAS TACCAAOTAA CAACTCTTTA ATrAAAACee TITAAAAAOr 

1321 CTTTGCTCAO AAACTCTACA TCGAAGCACC CAAGACT6TA CAATCTACTC CATCTTTTTC 
GAAACCACTC TTrCASATCT AOCTTCGTSS CTTCTCACAT CTTAGATCAG CTAGAAAAAC 

•nai -pAe— TAJLCTC A-AC-STOCT CTCCCrTTCT CAAAOCAAAC TCTTTGCTAT tCCTTGAATA 
W^??^G Ti«i=Al5i SIgGSAAAGA GTrTCCTrTC ACAAACCATA A6GAACITAT 

\A-r i^A'-TC-XlAGT TTTCTOCCrr ICCCTACTCA 3CTGGCCCAT CGCCCCTAAT GlVlCiTCTC 
5tSSm[ SiGiSSii icCGATGACT C3ACCGCGTA CCCCCOATTA CAAACAA6A6 

15151 ATCTCCAC-G GGTCAAATCC TACCTGTACC TTATGGTrCT GTTAAAAOCA 6WCTTCCAT 
TACACGTcic CcicTTTAGG ATGOACATOG AATACCAASA CAATTTTCCT CACGAAGCTA 

1561 AAACTACTCC TAGCAAATSC ACGSCCTCTC TCACGGATIA TAA3AACACA OTTTATTTTA 
rrrcAT«ACG ATCCTTTACG TGCCGGAGAG AGTGCCIAAT attcttctct caaataaaat 

V^l^^ ^^^^ ^^^^ ^ISS^iS ^IS^SSg^ 

^vsi^i ss^isi vz^c s^iissi ssi=c^3ss mis 

^S^^l SSiSS^ Sc^Ii^ ^I^I^ Si^SS ISSJSS 

isss issss 2:^^^ ^^^^ -ssss 

§SSI§§S ISSSToSi ISSSS^^ fc^i SSSSS ^SSiS 
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FIGURE 58C 

1921 CCACTCCCCC TATCTTOOCT CACTOCAACC TCCCCCTCCC CCGTTCAACC CAT7CTCCTG 
CCTCACCCCG ATAGAACCGA CTCACCTTSG AOCCCOAOCC GCCAAGTTCC CTAAGAGGXC 

1981 CCTCAOCCTC CTCACTAOCT CCCACTACAC CAGCCCCCCA CCACCCCCAC CTAATTTTTC 
&GACTCCGAG GACTCATCCA CCCT6ATGTC CTCCOCCGCT GCTGCGOQTC OATTAAAXAC 

2041 TATTTTTAGT AGACATGCGG TTTCACCATC TTCCCCACCA TOCTCTCCAT ""^GACTT 
ATAAAAATCA TCTCTACCCC AAAGIGCTAC AACCCCTCCT ACCACAOCTA AAGACCTGAA 

2101 CCTCATCCGC CTCTCTOGCC CTCC CAAACT GCTOCGATTA CAOOCGTOAC CCACCACOCC 
CCACTACCC6 SACAGACCCC GAOOCTTTCA CGACCCTAAT CTCCGCACTC CCTGCTCCCC 

2161 CCCCTTTAAA AAATCCTTTT GTAATGTAAG TGCXCCATAA TACCCTACAT CTTTATTAAT 
CCCOAAATTT TTTACCAAAA CATTACAXTC ACCTCCTATT ATGCCATGTA CAAATAATTA 

2221 XACAATAATA TTCTTTAGGA AAAAOOGCCC GGTCGTCATT -iSifrCATG J^AWCATTC 
TTGTTATTAT AAOAAATCCT TTTTCCCCCC CCACCACTAA ATGTGACTAC TCTTCCTAAG 

2281 CCCACTATGG AAAAAAAGCC CACCTTTTrC TGCTCTGCTT TTATTCAOTA «[Ji2Tiin5Ii 
CGCTCATACC TrrTTTTCCC CTCGAAAAAG ACCACAOOAA AATAACTCAT CTCATAACAT 

2341 GACATTGTAT ACAAmCAG AOTTGAATAA AACTTCCTCA ^AATTAtAGG AGTCCAGACA 
CTCTAACATi^ TCTTAAACTC TCAACTTATT TTCAA0GA6T ATTAATATCC TCACCTCTCT 

3401 CGAGACTCTC TTTCTTCCIT TCATTTTTAT ATTTAAOCAA OACCTCCACA TTTTCCAAGA 
CCTCTCAGAC AAAGAAGGAA ACTAAAAATA TAAATTCOTT CTCGACCTOT AAAACGTTCT 

2 4 61 AACTTTTTTT TTTTTAACCC GCCTCTCAAA AGGCGCCGGA TTTCCTTCTC CTCGAGGCAC 
TT2AAAAAAA AAAAA7TCCG CSCAGACTTT TCCCCCGCCT AAA6GAA0AG OACCTCCCTC 

2 521 ATGTTGCCTC TCTCTCTCGC TCCCATTGCT TCAOTCCACT CTA CAAACAC TCCTGTGGTG 
7ACAACGGAC ACAGACAGCC ACCCTAACCA A5TCACCTGA CATCTTTOTO ACQACACCAC 

2581 CAGAAACTGG ACCCCACGTC TGGACCGAAT TCCACCCTOC ACOOCTCATA ACCOACOCAT . 
CTCTTTCACC TCGGCTCCAC ACCTCCCTTA AOCTCGCACG TCCCGACTAT TCGCTCOGTA 

2 641 TACTGAGATT GACAGAOACT TTACCCCCCC CTCCTGGTTO GAGGCCGC6C AGTAGAGCA6 
ATCACTC7AA CTCTCTCTGA AATGGOGGOG CACCACCAAC CTCCOGOOCC TCATCTCOTC 

2 701 CA5CACACGC CCGCGTCCCG COAGGCCCGC TCTGCTC6CG C00AGAT6T0 GAATCTCCTT 
CTCCTCTCCS CGCCCAOCGC CCTCCCCCCG AGACCAOCOC GGCTCTACAC CTTA6ACCAA 

2761 CACOAAACCC ACTCGGCTGT CCCCACCCCC CGCCOCCCOC CCTGOCTGTG CCCTGCCOCC 
CTGCrXTGOC TGAOCCOACA CCCOTCCCOC OCCCOOGGCG OGACOCACAC GCCACCCCOC 

2821 CTGGTOCTGG CCOOTCOCTT CTTTCTCCTC CGCTTCCTCT TCOOTAOGGC OGCGCCTOGC 
GACCACGACC GCCCACCOAA GAAAGAGCAC CCGAAOGAGA AGCCATCCCC CCGO06AGOC 

2881 OGAGCAAACC TCCGACTCTT CCCCCTGGTG COGCOGTOCT GOOACTOGCO GGTCACCTGC 
CCTCCTTTGG AGCCTCAGAA GGOGCACCAC OOCCCCACGA CCCTGAGCGC CCAOTCGACG 

2941 CGAOTCOGAT CCT6TTCCTC OTCTTCCCCA OGCOGGGCGA TTAGGGTCGG CCTAATOTGG 
CCTCACCCTA OOACAACGAC CAOAAGGOGT CCCCGCCOCT AATCCCA6CC CCATTACACC 

3001 OCTOAOCACC CC7COAG 
CCACTCGTGG GGAOCTC 
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FIG. 76A 



XJUSGTXAXAATTXTCTCTTTTTTTCTCTCCCCCAATGTAXXAXOTTATAO - 

HMiiiiii 1 1 1 1 1 1 1 1 1 1 1 1 1 'uutji' jiimmi'Miu 

JUiGCTXAJUUlTTXTCTCTTrTTTTCTCTCCCCCAXTCTXAAAAGTTATAO - 
T<KK3TTTTA<a^TGTGTA«XXTCATTTTCTTAAAACOTATC - 
TCOCTTTTACATCTCTAGAATCATTTTCTTAAAACTTTATGAATACCAW - 
ATTTTCrTGTATTCTCTCACATCCCCACCTTACACAOASGACAC^ - 

uimmmTiiniTmTTim 

lAOeTOATATCCCaSOOCWiUulCT 

ATCTTTAGACTWUiCXGAACAAATTTTTCTGTCCTrACAOC™ 

llllililllitilllillinjIMIIII) 111111111111 
ATCTOTAGAOTGAAolcAAcA^ 

TCOCCTACAAGAAGCATGCACTCGGTTTATTATCAAOT 

1 1 1 III ill II 1 1 1 II 1 1 111 1 1 III 1 1 1 Ml 1 1 JJJJliil I iiJJl 

GTTrrTAAATATTTTCTACAAAAATCmXCT 

iTTiTTMmmMmirmmmmm^^ 

attcitatJJuItJUItcacgc^^ 

. TTACTGTCATITCATTTGTTAATATACTT^ 

I II 1 1 1 1 1 1 II 1 1 1 11 II 1 1 1 1 1 1 1 1 1 H 1 1 1 1 1 liUM iiJl I I 
. ctaciotcJLWtgaJ^tg^ 

. ATTTTAAAAAATTCCCrrrCGACTGTAGAACAAATAGQAATTTGGCCTGT 
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FIG. 76B 

lllllllllillilllllllMiilllMlllllliliililMMIII) 

- GGCCTCTXCTTGCTTXTTATATTTCTAXGCTACTCCTAGGAAXTXGCAAA - 

- TOCTCACTACCACTAATAAGAACATTTCTAAATCTOATCTO - 

lilllllllMilllllllilllllll Ml - lliillliiiilil ill 

- TCCTCACTACCyiCTJuLTAACAAC^ - 



_ TTTACACCTTATACTACCAAAAACAAAAOGOAAATTCTATCCOAfiATCTC - 

IMIIIIIIIIilllllliliiilMlillliillllMIIIMillltl 

- TCTACACOTATACTJ^dLA^^ - 

- CTTTGTTCTACWCCTAATCACiAAAACGTTGAAGATAAAGTTCTGGTACTC - 

- tTiTn?iTTnTtni imirnimtmnTiTi inm I - 

- ATTrUGTCTAATArrGAA^ " 



TTAAAATAAGGAAAGAAAGACACTGTGTTTTCT - 
TTAAAATAAGCAAAGAAACACACTGTGTTTTCT - 
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